rank | frequency | n-gram |
---|---|---|
1 | 2369 | -a |
2 | 1229 | -i |
3 | 664 | -e |
4 | 555 | -u |
5 | 465 | -o |
rank | frequency | n-gram |
---|---|---|
1 | 442 | -ga |
2 | 397 | -ia |
3 | 253 | -na |
4 | 209 | -ra |
5 | 199 | -ua |
rank | frequency | n-gram |
---|---|---|
1 | 432 | -nga |
2 | 112 | -tia |
3 | 112 | -ana |
4 | 97 | -hia |
5 | 95 | -ngi |
rank | frequency | n-gram |
---|---|---|
1 | 239 | -anga |
2 | 88 | -angi |
3 | 57 | -inga |
4 | 44 | -tahi |
5 | 43 | -enga |
rank | frequency | n-gram |
---|---|---|
1 | 111 | -tanga |
2 | 76 | -rangi |
3 | 44 | -ranga |
4 | 37 | -hanga |
5 | 21 | -whiti |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings